Calculating structural differences from binary differences in publish subscribe system

ABSTRACT

A method for more efficient structural parsing of binary representations of text based objects within a data distribution system. Clients subscribe to a topic maintained by the data distribution system server that publishers can publish to. Clients receive an original binary representation of a text based object describing the state of the topic to which the client is subscribed. In response to the state of the topic changing at the data distribution system server, clients receive a binary delta representing the change of the state of the topic. Based on the received binary representation and the binary delta, clients calculate an updated binary representation of the text based object. Using the original binary representation, the updated binary representation, and the binary delta, the client generates a structural delta representing the structural differences between data structures of the original text based object and data structures of the updated text based object.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.16/087,999 filed Sep. 24, 2018 titled “Calculating StructuralDifferences From binary Differences in Publish Subscribe System” whichis a 371 application of, and claims the benefit to, InternationalApplication No. PCT/IB2017/000397 filed Mar. 27, 2017 titled“Calculating Structural Differences From Binary Differences in PublishSubscribe System” and claims the benefit of U.S. Provisional ApplicationNo. 62/314,642 titled “Efficient Broadcast Using Binary Delta Streams”filed Mar. 29, 2016 and U.S. Provisional Application No. 62/315,170titled “Calculating Structural Differences From Binary Differences”filed Mar. 30, 2016, both of which are incorporated herein by referencein their entirety.

FIELD OF ART

This disclosure generally relates to the field of data distribution, andmore specifically, to publishing updates to topic information tosubscribers of the topic information.

BACKGROUND

The increased demand for data means that business systems andapplications must exchange data efficiently and intelligently at scalewith devices, browsers, and other applications over the Internet. Tomeet this increased demand for data, some data distribution platformsemploy a publish-subscribe model in which senders of messages, calledpublishers, publish messages into classes (e.g., topics) withoutknowledge of subscribers who may receive the messages. Subscribers in atopic-based publish-subscribe system will receive all messages publishedto the topics to which they subscribe, and all subscribers to a topicwill receive the same messages. Publishers establish a session with theserver to create and maintain topics and clients establish a sessionwith the server to consume data published by the publishers. Some topicshave values that change incrementally. For example, a topic with a valuerepresenting a person may be updated to change the email address,without affecting other fields such as the contact address. For such atopic, it is more efficient to notify clients of the differences betweenthe old and the new value, rather than providing the complete new value.

SUMMARY

In one embodiment a method of efficient structural parsing is described.A plurality of topics is maintained by a data distribution systemserver. A client subscribes to a topic of the plurality of topics. Afirst binary representation of a first text based object which describesa first state of the topic is received. In response to a change in thetopic being published to the distribution system server by a publisher,a binary delta from the data distribution system server is received. Thebinary delta represents a difference between the first binaryrepresentation of the topic and a second binary representation of asecond text based object describing a second state of the topic. Astructural delta from the binary delta is generated representing astructural difference between data structures of the first text basedobject and data structures of the second text based object.

In one embodiment, the second binary representation is calculated fromthe binary delta and the first binary value and the structural delta isgenerated from the first binary representation, the second binaryrepresentation and the binary delta. A first set of tokens indicating astructure type is determined from the first binary representation and asecond set of tokens indicating a structure type is determined from thesecond binary representation. At least one of the first set of tokens isparsed by an old span parser and at least one of the second set oftokens is parsed by the new span parser to determine the structuraldelta. Elements of the second text based object are displayedcorresponding to the structural difference of the structural delta.

In one embodiment, the structural delta includes at least one removestructure edit operation that indicates a portion of the first textbased object to be removed. Alternatively or additionally, thestructural delta includes at least one insert structural edit thatindicates a value to be added to the first text based object.

In one embodiment, the number of tokens parsed by at least one of theold span parser and new span parser is based on bytes in the binarydelta.

In one embodiment, determining the structural delta is based at least inpart on a first set of bytes associated with tokens parsed by the oldspan parser, a second set of bytes associated with tokens parsed by thenew span parser, the position of the byte in the first binaryrepresentation associated with the parsed tokens, the position of thebyte in the second binary representation associated with the parsedtokens, and the structure type of the parsed tokens.

In one embodiment, the last parsed token of the first set of tokensadvances the old span parser such that the old span parser is misalignedwith the last parsed token of the second set of tokens and the new spanparser. In response to the misalignment, at least one structural edit isgenerated for the structural delta.

In one embodiment, the last parsed token of the second set of tokensadvances the new span parser such that the new span parser is misalignedwith the last parsed token of the first set of tokens and the old spanparser. In response to the misalignment, at least one structural edit isgenerated for the structural delta.

In one embodiment to generate the structural delta, a plurality ofindividual structural edit operations are added to the structural deltaand false positives are removed from the added structural editsoperation. In one embodiment, a non-transitory computer readable mediumstores instructions for efficient structural parsing. The instructionsare executed by a processor and cause the processor to implement themethod described herein.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram of a data distribution system, according toone embodiment.

FIG. 2 is a block diagram illustrating components of the datadistribution system server, from FIG. 1 , according to one embodiment.

FIG. 3 is an interaction diagram illustrating the interactions between aprovider client, clients, and the data distribution system server,according to one embodiment.

FIG. 4 is a flow chart illustrating a method for continuous transmissionof binary deltas to subscribed client devices, according to one exampleembodiment.

FIG. 5 is a flowchart illustrating a method for continuously receivingof binary deltas, converting binary deltas to new topic values, andcalculating structural deltas on a client, according to one exampleembodiment.

FIG. 6 is an illustration of the calculation of a binary delta from anold topic value and a new topic value, according to one exampleembodiment.

FIG. 7 is a flowchart further illustrating a method for calculatingstructural deltas on a client, according to one example embodiment.

FIG. 8A is an illustration of a token parser, according to one exampleembodiment.

FIG. 8B is an illustration of a span parser, according to one exampleembodiment.

FIG. 8C-8E are illustrations of split cases when generating structuraldeltas, according to one embodiment.

FIG. 9 is a schematic diagram of a computing device for implementing aserver or client device, according to one embodiment.

The figures depict embodiments for purposes of illustration only. Oneskilled in the art will readily recognize from the following descriptionthat alternative embodiments of the structures and methods illustratedherein may be employed without departing from the principles of theinvention described herein.

DETAILED DESCRIPTION

I. Data Distribution System Architecture

FIG. 1 is a block diagram of a data distribution system 100, accordingto one embodiment. The data distribution system 100 includes a datadistribution system server 110, external systems 102, and client devices104.

One or more external systems 102 interact with the data distributionsystem server 110 to distribute data to multiple client applicationsover a network. An external system 102 may be a server associated with adata source for distribution via the data distribution system server110. Example data sources include entities such as a stock exchange, anonline game provider, a media outlet, or other source that distributestopical data to users over a network, such as the Internet.

The external system 102 communicates with the data distribution systemserver via a hosted application called a “publisher 114,” which enablesthe external system to create and maintain topics on the datadistribution system server 110 for distribution to multiple clients 104a-c. Alternatively, publishers 114 may operate as a separate processexternal to the data distribution system server 110, in which case thepublisher 114 is referred to as a provider client 108. For example, thepublisher 114 may be located within one of the external systems 102instead of the data distribution system server.

The client devices 104 communicate with the data distribution systemserver 110 through a network 180. The client devices include clients106. A client 106 can be an application that communicates with the datadistribution system server 110 using one or more specified clientprotocols. Example client protocols include WebSocket (WS) and HypertextTransfer Protocol (HTTP). Some clients connect to the data distributionsystem server to subscribe to topics and receive message data on thosetopics. Other clients 106, which have different permissions, performcontrol actions such as creating and updating topics or handling events(e.g. control client 106 d). The category of client depends on thelanguage of the application programming interface (API) and librariesused to implement it. Example APIs include JavaScript Unified API, JavaUnified API, .NET Unified API, C Unified API, iOS Classic API, andAndroid Classic API. Example client libraries include Flex andJavaScript.

The clients 106 can include web clients 106 a, mobile clients 106 b, andenterprise clients 106 c. Web clients 106 a can include browserapplications that use JavaScript, ActionScript, or Silverlight APIs.Enterprise clients 106 b may be any application connecting to the datadistribution system server 110 over a data distribution system serverprotocol for Transmission Control Protocol (TCP) over the Internet or anintranet, extranet using Java, .Net, or C APIs. Mobile clients 106 c maybe mobile applications that interact with the data distribution systemserver 110 using iOS or Android APIs.

Generally, clients 106 interact with the data distribution system server110 using an API 190. The API 190 may include the libraries appropriateto the platform executing the client application. The category of client106 depends on the language of the API and libraries used to implementit. Clients 106 may be implemented in one of a number of languages anduse variety of protocols to communicate with the server 110. Clients 106may perform different types of actions depending on their permissionsand the capabilities of the API they use.

Clients 106 used by data consumers typically subscribe to topics andreceive the updates that are published to these topics from the datadistribution system server 110. Clients used by data providers(hereafter, provider clients 108) typically create, manage, and updatetopics. These provider clients 108 also take responsibility for controlfunctions, for example authenticating and managing other clientsessions. Provider clients 108 may be a client within the externalsystem 102.

Additionally, clients 106 can include a settings module 130, aconversion module 132, and a user interface 134. The settings module 130can include client settings governing connections from the client 106 tothe data distribution system server 110. The settings module 130 canmanage and set permissions for actions that the client 106 or datadistribution system server 110 can take when connected to one anothervia the network 180. Further, the settings module 130 can manage towhich topics the client 106 is allowed to subscribe or which pushnotifications the client can receive from the data distribution systemserver 110. The conversion module 132 performs operations on the datareceived by the client. Example operations include structuralconflation, merging, calculating differences in topic values,constructing updated topic values from topic deltas, and calculatingstructural deltas. The user interface 134, e.g. a graphical userinterface, allows a user of the client device to interact with theclient 106. In particular, the user interface 134 is configured todisplay updated structures of subscribed topics as the client 106receives binary deltas from the data distribution system server 110 andcalculates structural deltas.

The data distribution system server 110 hosts publisher applications 114and a topic tree 116, manages connections from clients 106, sessionsfrom clients 106, and pushes data to the clients 106 through messagequeues. The data distribution system server 110 may be a standaloneserver or part of a cluster of servers to provide a scalable enterprisedata distribution solution. The data distribution system server 110pushes (streams) and receives data and events, in real-time, both to andfrom clients 106. The data distribution system server 110 includes ahigh performance network layer module 124, security enforcement module122, client session module 120, topic tree 116, data management module118, publishers 114, and a management console 112.

The high performance network layer 124 handles a high number ofconcurrent connections without the need for separate threads. Connectorshandle connections from many different types of clients 106 and forvarious protocols. Connectors may be configured to listen on differentports. Multiple clients 106 may connect to a single port.

The security enforcement module 122 authenticates all connections fromclients and manages authorization and setting permissions for actionsthat those clients 106 can take when they are connected to the datadistribution system sever 110.

The client sessions module 120 manages the sessions for all of theclients 106 that connect to the data distribution system server system110. Additionally, the client session module 120 stores informationabout each client 106 and topic subscription information about theclient's subscriptions to topics, such as by storing a list of topicselections. If a client 106 disconnects, it can reconnect to the samesession within a specified time period using the information stored inthe client session module 120.

In one embodiment, the client sessions module 120 manages which clientsare ‘in-session’ or ‘out-of-session.’ ‘In-session’ clients are clientsthat are connected to the data distribution system server, and‘out-of-session’ clients are clients that are disconnected from the datadistribution system server. In this embodiment, ‘in-session’ clients areclients that automatically receive pushes of updated topic tree 116values from the publisher 114, and ‘out-of-session’ clients are clientsthat do not automatically receive pushes of updated topic tree 116values from the publisher 114. Generally, clients that are subscribed toa topic are automatically in-session for the subscribed topic.

The data management module 118 performs operations on the data to moreefficiently deliver it to clients 106. Example operations includestructural conflation, merging, calculating differences in topic values,and replacing data to ensure that the latest data is received by theclient 106.

The management console module 112 may operate as an optional publisherthat is deployed by default. The management console module 112 may beused to monitor the operations of the data distribution system server110 through a web browser and to stop and start publishers within thedata distribution system server 110.

Publishers 114 can be components hosted within the data distributionsystem server 110 that manage the data for one or more topics andpublish messages to any clients 106 that subscribe to the topics thatthe publisher 114 manages. In one example, publishers are written usingthe Java API and extend the issued Publisher class and implement variousmethods to provide the publisher functionality. A publisher 114maintains its own data model. The publisher initializes its data as itstarts and updates it as a result of external events, (e.g. an externalsystem 102 changing the value of a topic within the topic tree 116).When a client 106 first subscribes to a topic the publisher 114 providesthe client 106 with a snapshot of the current state of the data relatingto that topic (e.g. transmitting the current topic value from the topictree 116 to the subscribing client 106). This is referred to as a “topicload.” A client 106 can also request the current state of a topic, evenif not subscribed to it, using the “fetch” command. Alternatively, thepublisher 114 can also topic load a client device in response to aclient becoming in-session (e.g. the client 106 establishes a highquality connection to the data distribution system server 110).

A publisher 114 maintains any changes to its topic data state andpublishes those changes to the topic as delta messages that includebinary deltas. In one embodiment, this results in the message being sentto every client 106 that is subscribed to the topic. Publishers 114 cansend messages to individual clients 106 (i.e. notify clients) or togroups of clients and can receive messages from clients 106. Undercertain operating conditions, the publisher 114 does not need to know orkeep track of the clients 106 subscribed to its topics. Publishers 114own the topics they create. Ownership of a topic is used to determinewhich publisher receives a message from a client, deals withsubscription, and, or creates dynamic topics. Publishers 114 hosted inthe data distribution system server 110 may act as client applications106 to other data distribution system servers. A publisher may do thisby subscribing to topics on the other servers to create a distributedarchitecture. Publishers can also store previous topic values and binarydeltas for each topic. The stored topic values can be in any format andcan be accessed at any time to calculate binary deltas. The storedvalues can be associated with a client 106.

II. Topic Trees, Topic Values and Subscription Services

The topic tree 116 represents a model of the organizational structure ofthe topics available to be published to clients and which the clientscan subscribe to. The topic tree is arranged hierarchically andcomprised of top-level topics with subordinate topics underneath thosetop-level topics. These subordinate topics can themselves havesubordinate topics. A topic of any type can be bound to any node of thetopic tree. The topic tree 116 may be maintained by the publisher 114,the client sessions module 120, or by other software within the datadistribution system server 110.

FIG. 2 is a diagram illustrating a topic tree 116, and comprised oflogical connections between Publishers and clients based on topics,according to one embodiment. In one example, topics may be arranged in atree structure. Topic A, B, C and D are the highest level of the treestructure. Topics B and C are beneath topic A in the topic tree andtopic E is beneath topic D in the topic tree in the second level of thetree structure. Topic C is beneath topic B in the third level of thetree structure. Topic D is beneath topic C in the fourth level of thetree structure. The location of the topic in the topic tree is describedby the topic path. The topic path includes all the topics above it inthe topic tree in an order separated by the slash character (/). Forexample, the path to topic B is A, B and the path to topic E is D, E.The topic tree may include any number of topics, and the topic treeshown in FIG. 2 is just one example of a topic tree.

Clients 106 and publishers 114 are loosely coupled through logical linksrepresenting the topics. A publisher 114 publishes messages to a topicand a client 106 subscribes to a topic and receives its messages. Forexample, Publisher 1 can publish to topic A or any topic subordinate totopic A. Publisher 2 can publish to topic D or any topic subordinate totopic D. Publisher 3 can also publish to topic B or C or any topicsubordinate to topic B or C.

Client 1 is subscribed to receive messages from Topic A at path A, topicB at path A/B and Topic C at path C. Client 2 is subscribed to receivemessages from Topic B at path A/B and Topic E at path D/E. Client 3 issubscribed to receive messages from Topic B at path A/B. Client 4 issubscribed to receive messages from Topic E at path D/E. In someembodiments, clients subscribing to a topic at the first-level of thetopic tree can automatically subscribe to all subordinate topics.

A topic path may also be used by a client 106 to send messages to thepublisher 104 that receives messages on that topic path. In someembodiments, the client is not aware of the publisher and aware of thetopic path. In other embodiments, the client is aware of the publishersand the topic paths.

Topics are created in the data distribution system server 110 bypublishers 114. As shown in FIG. 3 , the topic names are A, B, C, D, andE. In one embodiment, each topic name can appear in multiple locationswithin the topic tree. For example, topic D appears one time in thefirst level of the tree, and again in the fourth level of the tree. Inone embodiment, each topic can have a unique name within the datadistribution system server 110.

Each topic in the topic tree has a topic value, which may also bereferred to as a topic data state (i.e. a snapshot of the current stateof data relating to that topic). A text based data object can be used tohold the data state of the topic. The text based data object includeshuman readable text that describes the data state of the topic. In oneembodiment, the text based data object is in the JavaScript ObjectNotation (JSON) data interchange format. JSON is a self-describing,schema-less approach for objects and their related information. EachJSON object includes data structures defined by the JSON format, such asattribute-value pairs and arrays. The state of the topic can be capturedin the data structures of the JSON object. The examples described hereinwill use a JSON object as an example of a text based data object.However in other embodiments, data formats other than JSON are possible.

A binary value representation of the text based data object is typicallystored at the server 110 instead of storing the text based data object.In some embodiments, the text based data object may be stored by thedata management module 118, which then converts the text based dataobject into its binary value representation when communicating withclients. In one embodiment, the binary representation is generated byencoding a JSON object into a concise binary object representation(CBOR) data format. The resulting binary value representation is asequence of individual binary digits.

Further, the client sessions module 120 maintains subscriptioninformation about the topics each client 106 is subscribed to. For eachclient 106 the subscription information includes a list of topicselections. A topic selection includes data that identifies a subset ofthe topic tree 116 and whether the client 106 is subscribed to thatsubset of the topic tree 116. Each topic selection includes a topicselector which is hierarchical wild-card expression that identifies asubset of the topic tree. The data distribution system server 110 usestopic selectors to subscribe client sessions to appropriate topics. Eachtopic selection includes a subscription operation type value indicatingwhether the topic selector is subscribing to or unsubscribing from thetopic tree.

A client session uses subscribe and unsubscribe operations to change itstopic selections. Each operation adds a new topic selector to theclient's topic selections. When a new topic is added by the publisher114, the topic's path is evaluated against every client session's uniquetopic selections.

III. Delta Streams

For a particular topic, initial communication between the datadistribution system server and a client involves transmission of thetopic's value. In some embodiments, the initial communication can occuras a result of a client device subscribing to a topic of the topic tree.If the topic value later changes, the data distribution system server110 and data management module 118 calculates the differences (i.e. adelta) and, if it would require fewer bytes to transmit, sends the deltato the client rather than the new value of the topic. The client 106combines the current on-client topic value with the received delta tocalculate the new value.

For a sequence of partial incremental changes to a value, delta streamsmay greatly reduce the bandwidth and transmission latency costs to sendeach change to a client. Further, if many clients are subscribed to thesame sequence of updates, the server-side cost of calculating a delta isamortized across all of the subscribers.

As an example of the computational and bandwidth reductions of deltastreams, FIG. 3 is an interaction diagram 300 describing a delta streamwithin the data distribution system 100 including a provider client 108,a data distribution system server 110, and clients 106 a and 106 b. Inthis configuration, the client sessions module 120 is configured toprovide updates to clients 106 that are in-session.

To begin, the client 106 a starts a session 302 with the datadistribution system server 110. The client 106 a transmits the startsession request to the data distribution system server 100 via thenetwork 180. In some embodiments, in response to receiving the startsession request the data distribution system server 110 can provide asnapshot of the current topic value of the client 106 a. This functionssimilarly to a fetch.

The provider client 108 updates 304 (i.e. changes) a topic state of atopic within the topic tree 116 via the network 180. The publisher 114can publish the received topic state to the topic tree 116, such as bypublishing the new binary representation to the topic tree 116. The datadistribution system server 110 can, optionally, notify 304 thesubscribed client 106 a that the current topic state of the subscribedtopic has been updated.

To begin a client 106 a subscribes 308 to a topic of the topic tree 116at the data distribution system server 110 via the network 180 bysending a subscription request to the data distribution system server110. Generally, the subscription provides 310 the client with a snapshotof the current topic value of the client.

The provider client 108 incrementally changes the topic value andupdates 312 the topic value by transmitting the current binaryrepresentation of the topic to the data distribution system server 110via the network 180. The data management module 118 calculates thebinary delta between the previous binary value representation of aprevious JSON object and a new binary value representation of the newJSON object. The publisher 114 of can publish the current binaryrepresentation for the topic to the topic tree 116 using the delta. Thedata distribution system server 110 pushes 314 (i.e. transmits) thebinary delta of the subscribed topic to the in-session client 106 a.

At this point, a one to one correspondence between delta calculationsand clients 106 has occurred. To move forward, a second client 106 b (ormultiple clients 106) previously subscribed to the topic transmits astart session 316 request to the data distribution system server 110.Optionally, the data distribution system server provides 318 a snapshotof the current topic state to the client 106 b. Again, this operationcan be similar to a fetch.

The provider client 108 again updates 320 the current topic state of thesubscribed topic to a new topic state which the publisher 114 publishesto the topic tree 116. The data management module 118 calculates asingle binary delta between the current topic state and the new topicstate and pushes 322 and 324 the binary delta to the in-session clients106 a and 106 b. Here, a single binary delta is transmitted orbroadcasted to a large number of devices. The delta broadcast yieldssavings in computational power and bandwidth that scales with the numberof in-session clients.

The first client 106 a transmits an end session 326 request to the datadistribution system server 110. After this, the provider client againupdates 328 the topic state. As the first client 106 a is out-of-sessionit no longer receives an updated value of the topic The datadistribution system server pushes 330 a binary delta to the in-sessionclient 106 b. To conclude, the data distribution system server 100transmits an end session 332 request to the client 106 b.

As described above, in one embodiment, all all deltas are automaticallypushed to the subscribed clients. In this embodiment, snapshots of thecurrent state of a topic are provided when the client subscribes to atopic. Generally, pushes of the topic values only occur to clientdevices that are subscribed and in-session. If a client device istemporarily disconnected the data distribution system server can queuedeltas to push to the subscribed client and automatically pushes thedeltas when the client re-connects.

III.A Binary Delta Streams

The data distribution system server 110 supports a variety of dataformats that can be used for delta broadcasting. Example data formatsinclude JavaScript Object Notation (JSON) data interchange format, andsimple binary values, all of which can benefit from delta streamtransmission. The data distribution system server observes the variousdata formats and defines a binary representation of a value of a topicwithin the topic tree. In an implementation, the data distributionsystem server 110 creates a generalized binary delta stream based oncalculating differences between the observed binary representations,such as between an old binary value representation and a new binaryvalue representation. The calculated differences are encapsulated in acompact form referred to herein as a “binary delta.”

The data distribution system server 110 overcomes the issues of previousimplementations of updating topics by storing, receiving, andtransmitting topic values (i.e. topic state) in their binary form.Binary deltas may be calculated directly from the binary representationof two topic values, without the need for computationally expensivestructural parsing. Similarly, the recipient (e.g. a client 106) of abinary delta may calculate a new value without the need to parse themessage.

III.B Benefits for JSON Transmission

JSON is a popular data format for transmitting, storing, andrepresenting data across networks. However, transmitting JSON objectscan require a large amount of bandwidth to transmit over a network. Asan example of a topic state object, the following document in its JSONformat is expensive to transmit to clients

{“maps”: [“and”, “lists”]// 31 chars

This JSON data object includes a name/value pair that describes the datastate of a topic. “maps” is the name, while “and” and “lists” are thevalues associated with the name.

The bandwidth cost can be due to white space formatting (optionallyremovable at the cost of readability), redundant delimiters, and decimalrepresentation of numbers. Data savings can be achieved (around 20%)through the use of a more efficient binary representation.

The data distribution system server 110 and data management module 118use Concise Binary Object Representation (CBOR) data format to transmita binary representation of a JSON formatted data object to clients.

Binary delta streams provide more significant savings. Suppose a minorchange to the above document is made, changing one of the elements inthe array of the document as follows:

{“maps”: [“or’, “lists”]// 30 chars

To send the changed value would cost around 30 bytes (i.e. nearly thefull JSON document). The binary delta that the data distribution systemserver 110 and data management module 118 calculates for the above JSONvalues is:

// match 0,7; insert 626F72, match 11,8 // 000743626F720B08 // 8 bytes

The eight bytes of the binary delta are less expensive to transmit thanthe JSON document (i.e. JSON object) of the updated topic value. Suchchanges to a field value, but not a field name is typical within JSONobjects. The savings of transmitting binary deltas rather than entireJSON documents become more dramatic when changing a single field valuein larger JSON documents with more than one field.

To illustrate this, FIG. 4 is a process flow for the data distributionsystem server within the data distribution environment of FIG. 1participating in a delta stream similar to FIG. 3 . More specifically,the deltas of FIG. 4 are binary deltas, each binary delta associatedwith a change between two binary representations of JSON objects. In theembodiment of FIG. 4 , the data distribution system server 110automatically transmits binary deltas of topics to clients 106subscribed to the topic.

Initially, the data distribution system server 110 maintains 410 a topictree and information about client subscriptions to topics. Each topic ofthe topic tree can include the current binary representation of a JSONdata object. Further, in one embodiment, the information about theclient subscriptions to topics can include which clients 106 arein-session, which clients are out-of-session.

The data distribution system server can receive 412 a subscriptionrequest from at least one client 106 for subscribing to a topic of thetopic tree 116. The publisher 114 transmits 414 the current binaryrepresentation of the subscribed topic to the client 106. In anotherembodiment, the publisher 114 transmits a binary delta that allows theclient 106 to update a previous binary value representation to thecurrent binary value representation, or the publisher 114 transmits theJSON object for the topic.

The data distribution system server can receive 420 a new binaryrepresentation of a new topic state of a topic from the provider client108. Following this, the data management module calculates 422 thebinary delta between the received new binary representation and thecurrent binary representation of the current topic state. After thebinary delta is calculated, the data distribution system server updates424 the current binary value representation of topic state with the newbinary value representation of the topic state in the topic tree. Insome configurations, the replaced binary value representation, thebinary delta, and the new binary value representation can be stored bythe publisher 114. A more detailed description of the calculation ofbinary deltas is provided in the section of the description titled“IIIC. Specific Binary Delta Calculation Example.”

The publisher 114 transmits 426 the binary delta to the subscribedclients 106 via the network 180. In some configurations, the transmittedbinary delta and the current binary value representation of the topicstate can be stored by the publisher 114. This allows the datadistribution system 110 to monitor the most recently transmittedinformation to each client 106 for more efficient calculations of binarydeltas. For example, rather than providing a snapshot of the currentbinary value representation of the topic state at the client by sendingthe entire binary value representation of the topic state, the publisher114 can access the most recently transmitted binary value representationof the topic state and calculate a binary delta with the data managementmodule 118 to update the client 106 to the current binary valuerepresentation of the topic state.

The data distribution system server 110 can receive 430 an unsubscriberequest from a client 106. The unsubscribe request unsubscribes theclient 106 from the topic of the topic tree. The data distributionsystem server 110 no longer sends notifications of topic value changesto the unsubscribed client.

As discussed with FIG. 3 , in one embodiment, the data distributionsystem server 110 only sends binary deltas to clients that arein-session. The data distribution system server 110 can receive a startsession request to place the client in-session or an end-session requestto place the client out-of-session. In response to receiving the startsession request from the client, the data distribution system server canprovide the client with a snapshot of the current topic state. Inanother embodiment, a client being in-session can be associated with theconnectivity of the client to the network.

It is useful to examine the method of FIG. 4 from the perspective of theclient 106. FIG. 5 is a process flow for the client within the datadistribution environment 100 of FIG. 1 participating in a delta stream.More specifically, the deltas of FIG. 5 are binary deltas, each binarydelta associated with a change between two binary representations ofJSON objects of a topic in a topic tree 116. In the embodiment of FIG. 5, the client 106 automatically receives binary deltas from the datadistribution system server when subscribed to the topic.

The client transmits 512 a request to subscribe to a particular topic tothe data distribution system 110. In some embodiments, subscribing to atopic can result in the client receiving 514 a snapshot of the currentdata state of the topic from the topic tree 116 via the network 180. Thereceived current data state is typically a binary representation of aJSON object. In other embodiments the data state can be a JSON object,or a binary delta.

After the topic data state is updated to a new topic data state on thedata distribution system server 110, the client 106 receives 516 thebinary delta associated with the new topic data state from the datadistribution system server 110. At any point after receiving the binarydelta, the client can calculate 518 an updated binary valuerepresentation from the current binary value representation and thebinary delta. This process is described in the section titled “Updatinga Binary Value with Binary Deltas.”

At any point after calculating 518 an updated binary value, the client106 can calculate 520 the structural delta from the binary delta, thecurrent binary value, and the updated binary value. Calculatingstructural deltas is discussed in more detail in the section titled“Calculating Structural Deltas.”

Further, at any point the client can transmit 524 an unsubscribe requestto the data distribution system server. The client is unsubscribed fromthe topic and will no longer receive binary deltas when the topic stateof the unsubscribed topic changes on the data distribution system server110.

III.C Calculating Binary Deltas

Moving forward, the description turns towards specific methods forcalculating binary deltas. The data distribution system server employsan implementation of the Myers binary difference algorithm to calculatethe differences between the binary value representations of the twotopic values (i.e. binary representation of two JSON objects). The Myersdifference algorithm is described in “An O(ND) Difference Algorithm andIts Variations,” Algorithmica, 1:251-266, 1986. The result is a compactedit script describing the changes from an original binary valuerepresentation to a new binary value representation in terms of twooperations: a ‘match’ edit operation which copies a byte sequence fromthe original binary value representation and an ‘insert’ edit operationthat adds a new byte sequence. The match operation is generally includedin the binary delta as bytes indicating the position and number of thematching bytes between the two binary values. The insert operation isgenerally included in the binary delta by including the inserted bytesin the binary delta.

Additionally, there can be a ‘delete’ edit operation that describes abyte sequence removed from the original binary value representation. Thedelete operation is generally included in the binary delta as bytesindicating the position and number of the deleted bytes from a binaryvalue representation. The data distribution system server and datamanagement module may re-calculate delete edits by considering theoriginal value and the gaps between match and insert edits, such thatredundant delete edits are not included in the transmitted binary delta.In one embodiment, the data distribution system server and datamanagement module may omit delete edits from the binary delta. In thisembodiment, the client 106 can infer that the delete edit exists basedon the original binary value representation and the gaps between matchand insert edits. This is described below.

The binary delta format used by the data distribution system server 110can prevent consecutive edits of the same type, and can present edits inorder with no overlaps or gaps in the byte strings to which they refer.The following is an example of the structure edit script for thetransformation of the CBOR encoding binary values of the previousexample's old and new JSON objects, i.e. {“maps”: [“or”, “lists”], and{“maps”: [“and”, “lists”]. To begin the binary byte strings for the oldand new binary representations are:

Old Binary Representation: BF646D6170739F63616E64656C69737473FFFF

New Binary Representation: BF646D6170739F626F72656C69737473FFFF

From these two byte strings, an edit script between the old and newvalues is generated. The edit script represents how the old binary valuerepresentation should be edited to generate the new binary valuerepresentation. The edit script for this example is:

match [0,7] // “BF646D6170739F” insert “626F72” delete [7,5] match[11,8] // “656C69737473FFFF”Match [0,7] means that at byte position 0 of the old binary valuerepresentation, 7 bytes of the old binary value representation matchwith the new binary value representation. Insert “626F72” means that thebyte string of 626F72 should next be inserted into the new binaryrepresentation. Delete [7,5] means that at byte position 7 of the oldbinary value representation, 5 bytes should be deleted and nottransferred to the new binary value representation. Match [11, 8] meansthat at byte position 11 of the old value, 8 bytes match with the newbinary value representation.

To generate the binary delta, the server 110 discards the redundantdelete operation, and encodes the script using a pair of CBOR integersto represent match operations, and a CBOR byte string for insertoperations. The resulting binary delta is eight bytes long:000743626F720B08. A specific example of the generation of a binary deltais given below.

Henceforth, byte strings will use the following notation [B01, B02, B03,. . . ] where B01, B02, etc. are the bytes contained in a byte string,the bytes of the byte string are separated by ‘,’ for readability, andthe byte string is bounded by ‘[’ and ‘]’. For example, the previousbinary delta 000743626F720B08 is equivalent to [00, 07, 43, 62, 6F, 72,0B, 08]. 00 and 07 represents the operation match [0, 7]. 43 is bytecodefor an insert operation of 3 bytes, which is follows by insertion bytesof 62, 6F and 72. 0B, 08 represents operation match [11, 8]

The data distribution system server takes advantage of generating binarydeltas within a topic state streaming foundation (i.e. the datadistribution system). The data distribution system uses thisdata-oriented approach, the ability to share the cost of deltacalculations across many subscribers (i.e. clients converting binarydeltas to current values on the client), and the common occurrence ofpartially changing data streams amongst typical application use-cases tominimize system bandwidth and computational requirements of topic statestreaming.

III.D Specific Calculation Example

The above description uses two topic data states represented by JSONobjects, converts the objects into CBOR byte strings, and compares thebyte strings to generate a binary delta for transmission within the datadistribution system. The example within the sections titled ‘BinaryDelta Streams’ and ‘Calculating Binary Deltas’ are described to providean indication of a large amount of data that can be conserved by usingthis process, e.g. ˜30 bytes to 8 bytes. However, the JSON objects usedwithin the previous example are overly complex for an in depthdescription of the calculation of a binary delta. Below is a simplerexample of calculating a binary delta between two JSON objects.

FIG. 6 shows a more thorough example of the calculation of a binarydelta from binary representations of two JSON objects. Generalizing, theold JSON object is converted into bytes 610 and structured as a 15 bytestring using the CBOR format, e.g. the old byte string 620 O01→O15.Similarly, a new JSON object is converted into bytes 610 and structuredas a 17 byte string using the CBOR format, e.g. the new byte string 622N01→N17. In a more specific example, the old JSON object and the newJSON object can be:

Old Value: {“a”: [1, 2], “b”: [1, 2, 3]}

New Value: {“a”: [“xyz”, 2], “b”: [1, 2]}

Further, the byte string for the new value (i.e. new byte string 622)can be

N01 N02 N03 N04 N05 N06 N07 N08 N09 N10 N11 N12 N13 N14 N15 N16 N17 { ″a″ [ ″ x y z″, 2 ], ″ b″: [ 1 2 ] } BF 61 61 9F 63 78 79 7A 02 FF 61 629F 01 02 FF FFwhere the first row are generalized CBOR bytes (N01, N02, etc.), thesecond row is the JSON element associated with each byte, and the thirdrow is the CBOR bytes of the associated JSON element represented inhexadecimal.

Similarly, the byte string for the old value (i.e. old byte string 620)can be

O01 O02 O03 O04 O05 O06 O07 O08 O09 O10 O11 O12 O13 O14 O15 { “ a” [ 1 2], “ b”: [ 1 2 3 ] } BF 61 61 9F 01 02 FF 61 62 9F 01 02 03 FF FFwhere the first row are generalized CBOR bytes (O01, O02, etc.), thesecond row is the JSON element associated with each byte, and the thirdrow is the CBOR bytes of the associated JSON element represented inhexadecimal.

In this example, the four bytes of the old byte string 620 and new bytestring 622 match 640 and a match edit is generated for the binary delta624. Using the generalized nomenclature, bytes O01→O04 match bytesN01→N04 and the data management module produces the match edit ‘match[M01, M02]’. In some embodiments, the match edit generates two matchbytes 650 for the binary delta [M01, M02]. The first match byte, M01,indicates the position in the old byte string 620 in which the matchingbytes of the new byte string begin (e.g. 1^(st) byte, 2^(nd) byte,etc.). The second match byte, i.e. M02, indicates the number of bytesfrom the new byte string that match the old byte string (e.g. 4 bytes, 5bytes, or 10 bytes).

Using the CBOR bytes of the old and new byte strings as an example, thematch edit can be match [00, 04] and the generated bytes for the binarydelta can be [00, 04] where ‘00’ indicates the byte position where thematch begins and ‘04’ indicates that four bytes from the new byte stringmatch the old byte string.

To continue, the fifth through eighth bytes of the new byte string 622are dissimilar from the fifth byte of the old byte string 620 and aninsert edit is generated for the binary delta 624. Using the generalizednomenclature, byte O05 is dissimilar from byte N05 and the new byte 622string does not again match the old byte string 620 until bytes O06 andN09, respectively. Thus, the data management module 118 produces theinsert edit ‘insert I02, I03, I04, I05’. Note the I02→I05 are equivalentto N05→N08 in this context. In some embodiments the insert editgenerates a first insert byte, i.e. I01, to indicate that subsequentbytes, i.e. I02→I05, will be inserted from the new byte string 622 intothe old byte string 620. Therefore, the insert edit generates fiveinsert bytes 652 for the binary delta [I01, I02, I03, I04, I05]. Thedata manipulation module 118 can be configured such that the insert editbegins at the byte immediately following the most previous edit.

Using the CBOR bytes of the old and new byte strings as an example, theinsert edit can be ‘insert 6378797A’ and the generated bytes for thebinary delta can be [44, 63, 78, 79, 7A] where ‘44’ indicates that fourbytes will be inserted, and ‘63’, ‘78’, ‘79’, and ‘7A’ are the insertedbytes.

The fifth byte of the old byte string 620 is dissimilar from the fifthbyte of the new byte string 622 and a delete edit is generated for thebinary delta 624. That is, byte O05 of the old byte string 620 isdissimilar from byte N05 of the new byte string 622 and the datamanagement module 118 produces the delete edit ‘delete [D01, D02]’. Insome embodiments, the delete edit generates two bytes for the binarydelta [D01, D02]. The first delete byte, D01, indicates the position inthe old byte string 620 of the first byte to delete. The second deletebyte, D02, indicates the number of bytes from the old byte string todelete. Generally, delete edits and the corresponding bytes within thebinary delta are redundant and are removed from the binary delta (or notgenerated) and not transmitted. Removing redundant delete operationsfurther reduces the bandwidth necessary in a binary delta stream byreducing the total number of bytes in a binary delta. A concrete exampleof this will be provided below.

The sixth through twelfth bytes of the old byte string 620 match theninth through fifteenth bytes of the new byte string 622 and a matchedit is generated for the binary delta 624. Similarly to above, bytesO06→O12 match bytes N09→N15 and the data management module produces thematch edit ‘match [M03, M04]’. The match edit generates two match bytes654 for the binary delta [M03, M04].

Using the CBOR bytes of the old and new byte strings as an example, thematch edit can be match [05, 07] and the generated bytes for the binarydelta can be [05, 07] where ‘05’ indicates the beginning byte positionwhere the match begins and ‘07’ indicates that seven bytes from the newbyte string 622 match the old byte string 620. In this example, as thematch edit begins at the fifth byte, the previous delete edit thatoccurs at the fifth byte is redundant and removed from the binary delta624.

The following table represents the calculated match, insert, and deleteedit operations generating the binary delta of the above example:

Binary Delta Edit Old Value New value match [0,4] {“a”:[ {“a”:[ insert6378797A — “xyz” delete [4,1] 1 — match [5,7] 2],”b”:[1,2 2],”b”:[1,2delete [12,1] 3 — match [13,2] ]} ]}Using the above table, the data management module 118 calculates 422 thebinary delta 624 of the above example as

M01 M02 I01 I02 I03 I04 I05 M04 M05 M06 M07 00 04 44 63 78 79 7A 05 070D 02where the top row is the generalized binary delta bytes and the bottomrow are the specific values of the CBOR bytes in the binary delta 624 ofthe example. In this example, the data management module removes alldelete edit operations from the binary delta 624. In some embodiments,any or all of the delete edits may be maintained by the data managementmodule 118 and inserted into the binary delta 624.

The above example is used as a demonstration of the calculation ofmatch, insert, and delete edit operations for a binary delta stream incalculating binary deltas between topic values. Any number andarrangement of match, insert, and delete operations can generate abinary delta to modify any number of bytes in any order between the twotopic values.

III.E Updating a Binary Value with a Binary Delta

The previous example demonstrated how the data management module 118 ofthe data distribution system server calculated 422 binary deltas totransmit to clients. This section describes how the conversion module132 of a client 106 can use the binary delta 624 to calculate updatedbinary value representation from current binary value representation anda binary delta 518. Similar to before, the bytes of the old binary valuerepresentation are formed into an old byte string using the conversionmodule 132 of the client, i.e. O01→O15. Using the above genericizedbinary delta, i.e. [M01, M02, I01, I02, I03, I04, I05, M04, M05, M06,M07], the current binary value representation is updated to the newbinary value representation. The bytes of the new value can berepresented as a new byte string, i.e. N01→N17.

The first two bytes of the binary delta 624, i.e. [M01, M02], indicatethat the position (M01) and the number (M02) of bytes from the new bytestring 622 that match the old byte string 620. The matching bytes remainunchanged. The next byte of the binary delta 624, i.e. [I01], indicatesthat the next four bytes of the binary delta, i.e. [I02, I03, I04, I05],will be inserted into the old byte string 620. The next two bytes of thebinary delta 624, i.e. [M03, M04], indicate the position (M03) and thenumber (M04) of bytes from the new byte string that match the old bytestring. The matching bytes remain unchanged but are moved within the oldbyte string 620. Finally, the subsequent two bytes of the binary delta624, i.e. [M04, M05], indicate the position (M04) and the number (M05)of bytes from the new byte string 622 that match the old byte string.The matching bytes remain unchanged but are moved within the old bytestring. Generally, each new edit from the binary delta will insert,shift, rearrange, or match bytes of the old byte string 622 such thateach subsequent edit of the old byte string 622 is immediately followingthe last byte of the previous edit.

Using the specific example presented above, the byte string of the oldvalue of the topic at the client 106 is

O01 O02 O03 O04 O05 O06 O07 O08 O09 O10 O11 O12 O13 O14 O15 BF 61 61 9F01 02 FF 61 62 9F 01 02 03 FF FFand the binary delta 624 received at the client 106 is

M01 M02 I01 I02 I03 I04 I05 M04 M05 M06 M07 00 04 44 63 78 79 7A 05 070D 02

The first two bytes of the binary delta, [00, 04], indicate that theposition (00) and the number (04) of bytes from the old stream that arematching. These bytes are maintained for the new byte string, e.g. [BF,61, 61, 9F]. The next bytes of the binary delta [44] indicates that thenext four bytes of the binary delta [63, 78, 79, 7A] will be insertedinto the old byte string. In this configuration, the conversion module132 of the client 106 inserts the bytes immediately following the lastmatch edit, e.g. [BF, 61, 61, 9F, 63, 78, 79, 7A]. The next two bytes ofthe binary delta, [05, 07], indicate the position (05) and the number(07) of bytes from the new byte string that match the old byte string.The matching bytes are shifted from their original position to the endof the inserted bytes, e.g. [BF, 61, 61, 9F, 63, 78, 79, 7A, 02, FF, 61,62, 9F, 01, 02]. The next two bytes of the binary delta, [0D, 02],indicate the position (0D) and the number (02) of bytes from the newstream that match the old stream. The matching bytes are shifted fromtheir original position to the end of the previous match bytes, e.g.[BF, 61, 61, 9F, 63, 78, 79, 7A, 02, FF, 61, 62, 9F, 01, 02, FF, FF].

III.F Visualizing Binary Deltas

A visualization of this process is also shown in FIG. 6 . The match andinsert edits are shown as relationships between the new byte string andthe old byte string. The edit operations (Match 640; Insert 642; Delete644) form a spine down the middle, aligned with the relevant bytesequences of the old byte string 620 and the new byte string 622 of theold and new binary value representation.

Observe that the old binary value representation correspond to acontinuous sequence of match 640 and delete 644 edit operations, andthat the new binary value representation corresponds to a continuoussequence of match 640 and insert 642 edit operations.

IV. Structural Deltas

The description now shifts towards the calculation of structural deltas520. Structural deltas are determined at the client 106 by theconversion module 132. The conversion module 132 uses the binary delta,the received binary value representation, and the current binary valuerepresentation (i.e. the new and the old binary values representationsof the previous example, respectively) to determine a structural delta.The binary value representations are generally represented as CBOR bytestrings as previously described. The conversion module 132 employs analgorithm that uses the edit operation boundaries in the binary delta,in conjunction with two streams of token events from a structure-awareparser, to identify potential structure edits in a correspondingstructural delta.

Beginning with the initial broad example, the distributed data server110 supports the use of JavaScript Object Notation (JSON) datainterchange format to represent rich composite data. For example thefollowing two topic values (i.e. topic data states) may be representedas JSON objects:

Object 1: {“maps”: [“and”, “lists”]}

Object 2: {“maps”: [“or”, “lists”]}

The following represents a structural delta that describes thedifferences between the two JSON objects.

REMOVE { “/maps/0” : “and” } INSERT { “/maps/0” : “or” }

Additionally, returning to the more detailed example provided in thespecific calculation sections:

Old Object: {“a”: [1, 2], “b”: [1, 2, 3]}

New Object: {“a”: [“xyz”, 2], “b”: [1, 2]}

The following represents a structural delta that describes thedifferences between data structures of the old and new JSON objects.

REMOVE /a/0=1 /b/2=3 INSERT /a/0=”xyz”

Each structural delta is composed of two structural edit operationsdescribing changes to make to data structures of the old JSON object inorder to generate the structures of the new JSON object. The REMOVE editoperation contains data items removed from the old JSON object. TheINSERT edit operation contains data items added to the old JSON objectto generate the new JSON object. Each edit operation includes a a JSONPointer expression that identifies a location within a JSON datastructure, and a value for that location. In these examples, the JSONpointer/a/0 and/maps/0 appear in both the REMOVE edit operation and theINSERT edit operation to represent that the value at that location haschanged.

Structural deltas are particularly valuable where there are smallchanges to complex values. The client performs operations that returnthe parts of a structural delta that affect a given data item, allowingapplication components to filter changes of interest. Embodimentsdescribed herein can determine the structural delta from the binarydelta in a computationally efficient manner that reduces the amount oftime needed to generating the structural delta in a computer.

IV.A General Structural Delta Overview

A general description of the process for generating a structural deltaalgorithm is now provided. Two token parsers are used, one for steppingover the old binary representation of the old topic value and one forstepping over the new binary representation of the new topic value. Atoken parser is called incrementally to step forward over the bytes ofeach binary representation of the topic values, generating tokens as itidentifies parts of the corresponding JSON object of the topic value,such as strings, numbers, or structure delimiters. The sequence oftokens collectively forms the value's token stream. These parsers arereferred to below as the old token parser and the new token parser,respectively. Each token parser also maintains a stack reflecting thestructural path of the current parse position.

Each binary delta edit operation in the binary delta is evaluated inturn, parsing the new and old byte strings of the binaryrepresentations. The token found at the end of each edit represents apoint where there may be a structural difference between the two JSONobjects for the old and new values.

Tokens in the old value covered by a match edit are skipped as they alsoappear in the new value. For each token in the old value covered by adelete, a REMOVE structural edit is added. For each token in the newvalue covered by an insert, an INSERT structural edit is added.Particular care is necessary where a token covers one or more binarydelta edits. These cases of misaligned edges are called a split and aredetailed in the section title “Splits.” More simply, a split is a pointin a value's binary representation where the end of a binary delta editdoes not correspond to the end of a token in the value's token stream.

The process of matching binary deltas against the parsed tokens producesa correct but sub-optimal structural delta. False positives are commonlyreported, where the same sub-value is added to both the REMOVE andINSERT sets. This can occur if the binary delta associates binary datain some data item with structure delimiters, or because an arraycontains multiple instances of the same data item and the “wrong” onewas matched with a change in the new value. A different problem occurswhere all of the elements in a structure are reported as a sequence ofREMOVE or INSERT edits—in this instance it can be better to report asingle edit against the structure itself. Various heuristics are appliedto address these problems and improve the quality of the result and aredescribed below.

IV.B Tokens

Generally, tokens are single structural elements of a value. Forexample, a token can be a single item from a JSON object such as anumber, string, structure delimiter, or the start and end of an array.As examples, CBOR byte strings can result in the following types oftokens when parsed by a token parser:

Token Type Example JSON Syntax STRING “xyz” NUMBER 1.234 TRUE true FALSEfalse NULL null START_OBJECT { END_OBJECT } START_ARRAY [ END_ARRAY ]FIELD_NAME { ″field_name”: value}

The token stream for the old JSON object from the previously describedexample, i.e. {“a”: [1, 2], “b”: [1, 2, 3]}, having the CBOR byte string[BF, 61, 61, 9F, 01, 02, FF, 61, 62, 9F, 01, 02, 03, FF, FF] isSTART_OBJECT, FIELD_NAME, START_ARRAY, NUMBER, NUMBER, END_ARRAY,FIELD_NAME, START_ARRAY, NUMBER, NUMBER, NUMBER, END_ARRAY, END_OBJECT.In this example, there is a token for every byte except the values ofthe FIELD_NAME tokens (i.e. “a” and “b”).

IV.C Span Parsers

Each token parser is wrapped by a higher level component known as a spanparser. The span parser uses a method that reads tokens up to a targetbyte offset, reporting the result as a sequence of JSON pointerexpressions. The sequence of tokens read by a single operation is knownas a span. The span parser collapses each complete structure found in aspan to a single JSON Pointer reference to the structure. A spanfinishes at the end of the first token found after the target offset,with two exceptions. First, if a split is found in a field name, thespan continues until the next sub-value (potentially parsing oneadditional token of the token stream). This is because the same JSONpointer is used to represent a change to a field name or a change to afield's value (except if the value is a structure and the change onlyaffects part of the structure). Second, END_STRUCTURE tokens are alwaysread which can have two benefits: first, reading END_STRUCTURE tokenscollapses empty structures to the appropriate parent pointer; andsecond, the closing tokens of non-empty structures are associated withthe last pointer which simplifies edit operation boundary detection. Theimplementation relies on the span parser not reading START_STRUCTUERtokens at a split unless instructed to do so by the conversion module.This provides greater alignment between the detected binary differencesand the token parsers, allowing a simpler pairing of tokens between thetwo streams.

The two token streams are treated symmetrically. Each span parser ismoved in a forward direction; there is no backtracking. Within thismethod, based on any of: the misalignment of old and new span parserand, the current edit type, the JSON pointers associated with the parsedbytes, and the tokens parsed, the conversion module 132 can generatepotential structural edits for the structural delta.

IV.D Heuristic Clean Up

The basic approach of using the binary delta to identify splits andmatching the splits to the appropriate tokens produces a reasonable butimperfect result. Further processing is performed to improve the output.False positives (redundant REMOVE, INSERT pairs) can occur in the rawresults. These false positives are identified and removed by comparingeach potential REMOVE edit with the previous INSERT edit, and eachpotential INSERT edit with the previous REMOVE edit. If the associatedvalues are equal, and the pointers are compatible, both edits areremoved. IN another example, the raw results can contain pointers to theentire contents of a structure and are replaced with a single pointer tothe structure.

IV.E Calculating Structural Deltas

To continue FIG. 7 is a process flow for calculating a structural deltafrom a received binary delta, a current binary value, and an updatedbinary value. In one embodiment, the process flow of FIG. 7 representsstep 520 of the process flow of FIG. 5 .

To begin, the conversion module 132 generates 710 binary edit operations(i.e. match, insert, delete) from the received 516 binary delta. Thebinary edit operations are generated based on the edit boundaries withinthe binary delta byte string.

For every binary edit operation, the conversion module 132 parses 720bytes of the old binary representation into tokens in accordance withthe binary edit operation. That is, for each edit operation of thebinary delta, the conversion module 132 parses bytes of the old bytestring and generates tokens for the old token stream. Similarly, theconversion module 132 parses 730 bytes of the new binary representationinto tokens in accordance with the binary edit operation. That is, foreach edit operation of the binary delta, the conversion module 132parses bytes of the new byte string and generates tokens for the newtoken stream.

For each binary edit operation, the conversion module 132 adds 740structural edit operation(s) to the structural delta based on the tokensof the old token stream and new token stream parsed by the old spanparser and new span parser, respectively. That is, for the parsed tokensof each token stream, the span parsers compare the parsed tokens anddetermines a structural edit operation based on the type of parsedtokens and the relative location of each span parser in their respectivetoken stream. Based on any of: the misalignment of old and new spanparser and, the current edit type, the JSON pointers associated with theparsed bytes, and the tokens parsed, the conversion module 132 cangenerate potential structural edits for the structural delta. This isexplained in the section titled “Specific Structural DeltaCalculations.”

After the new and old token streams have been parsed, the structuraldelta is analyzed and the conversion module 132 removes 750 falsepositives from the structural edit operations. After the false positiveshave been removed, the remaining structural edits are output 760 as astructural delta. The structural delta represents structural changes inthe data structures from the old JSON value to the new JSON value. Insome instances the structural delta can be used by an application todetermine a new value of the topic. The new value of the topic can bedisplayed on the user interface of the client device. In someembodiments, only elements of the value that are changed by thestructural delta are updated on the display (e.g. a stock ticker, or asports score).

The structural delta can provide an application on the client devicewith a decoded representation of the change between the two binaryvalues representing the same topic. Importantly, the structural deltaalgorithm requires only a single parse of both values. The computationaltime of the algorithm scales with the size of the value and the binarydelta. The computational space required also scales as a function ofstructure depth of each value but is independent of the value size.Simply stated, larger values and larger binary deltas are more expensivecomputationally, but small values and small deltas are relativelyinexpensive computationally. Generally, the binary delta streaming usessmaller values and deltas that are more computationally efficient thanalternative algorithms for determining structural deltas.

IV.F Visualization of Structural Deltas

For ease of understanding visualization of elements of structural deltacalculations are provided. FIG. 8A illustrates a representation of theold token parser 810 parsing 720 the bytes 610 of the old token string816. The old token parser generates tokens 814 for the old token stream816. Each token 814 of the old token stream 816 can be associated withthe location of a byte 610 in the old byte string 620.

FIG. 8B illustrates a representation of the old span parser 820 andparsing 824 the tokens 814 of the old token stream 816 and the new spanparser 822 and parsing 826 the tokens 814 of the new token stream 818.The number of tokens 814 parsed depends on the current edit operation830 of the binary delta 624. In this example, four tokens from the oldtoken stream 816 are parsed reading 5 bytes of the old byte string 620and four tokens from the new token stream 818 are parsed reading 8 bytesof the new byte stream 622. Based on the token type, JSON pointers,parser location, and edit type, the span parser can generate 740potential structural edits 840 for the structural delta 842. In thisexample, the misalignment of old and new span parser 820 and 822, thecurrent edit type 830, and the JSON pointers associated with the parsedbytes, and the tokens parsed generates two potential structural editsfor the structural delta.

IV.G Specific Structural Delta Calculation Example

To continue, calculation of a structural delta from a binary delta, acurrent binary representation and an updated binary representation isexplained using the previously described old binary representation, newbinary representation, and associated binary delta. For reference thenew binary representation is:

N01 N02 N03 N04 N05 N06 N07 N08 N09 N10 N11 N12 N13 N14 N15 N16 N17 { ″a″ [ ″ x y z″, 2 ], ″ b″: [ 1 2 ] } BF 61 61 9F 63 78 79 7A 02 FF 61 629F 01 02 FF FF

The first row is generalized CBOR bytes (N01, N02, etc.), the second rowis the JSON elements associated with each byte, and the third row is theCBOR bytes of the associated JSON elements represented in hexadecimal.Parsing 720 bytes of the new byte string to generate the new byte streamfor the new value is similar to the description for the old token streamabove.

Similarly, the old binary representation is:

O01 O02 O03 O04 O05 O06 O07 O08 O09 O10 O11 O12 O13 O14 O15 { ″ a″ [ 1 2], ″ b″: [ 1 2 3 ] } BF 61 61 9F 01 02 FF 61 62 9F 01 02 03 FF FF

The first row is generalized CBOR bytes (O01, O02, etc.), the second rowis the JSON elements associated with each byte, and the third row is theCBOR bytes of the associated JSON elements represented in hexadecimal.Parsing 720 bytes of the old byte string to generate the new old streamfor the old value is described above.

The binary delta between the two values is:

M01 M02 I01 I02 I03 I04 I05 M04 M05 M06 M07 00 04 44 63 78 79 7A 05 070D 02The first row is generalized CBOR bytes (M01, I01, etc.), the bottom roware the specific values of the CBOR bytes in the binary delta of thedescribed example.

The conversion module 132 generates 710 a first binary edit operationfrom the received binary delta. For example, the first binary editoperation can be the first two bytes of the binary delta byte string,i.e. [M01, M02]. The binary edit operation can be interpreted by theconversion module 132 as a match edit ‘match [M01, M02]’. Based on thegenerated 710 binary edit operation the new/old token parsers parse 720and 730 the new/old byte string and create a new/old token stream. Oncethe match edit has been processed, based on the edit operation type, theparsed tokens of the token streams, the parsed bytes of the bytestrings, and the position of the span parsers within the token streams,the conversion module 132 adds 750 potential structural edit operationsto the structural delta.

Using the specific example from above, the first generated 710 binaryedit operation is the first two bytes of the binary delta byte string,i.e. [00, 04]. The binary edit operation can be interpreted by theconversion module 132 as a match edit ‘match [0,4]’. Based on thegenerated binary edit operation, the new/old token parsers parse 720 and730 the new/old byte string and create a new/old token stream.

In this example old span parser advances up to and including the fourthbyte of the old binary representation. The old span parser reads thefollowing four whole tokens:

Token Sub-Value START_OBJECT { FIELD_NAME “a” START_ARRAY [ NUMBER 1

Following the methodologies described above, the old span parsercontinues beyond the fourth byte and reads the first element of thearray (i.e. the number 2) processing 5 bytes of the old byte string,i.e. [BF, 61, 61, 9F, 01].

Similarly the new span parser identifies advances up to and includingthe fourth byte of the new binary representation. The new span parserreads the following four whole tokens:

Token Sub-Value START_OBJECT { FIELD_NAME “a” START_ARRAY [ STRING “xyz”

Following the methodologies described above, the old span parsercontinues beyond the fourth byte and reads the first element of thearray (i.e. the string) processing 8 bytes of the new byte string, i.e.[BF, 61, 61, 9F, 63, 78, 79, 7A].

The end of the match edit operation generated 710 from the binary deltais found and a potential REMOVE and INSERT structural edit are added tothe binary delta 740. In this example the REMOVE edit is at/a/0 with thevalue of 1 and a potential INSERT edit is at /a/0 with the value of“xyz”.

Continuing for the next edit operation, the conversion module 132 infersand generates 710 a delete edit operation from the binary delta based onadjacent edit operations and the tokens parsed by the old span parser.For example, the second binary operation can be interpreted by theconversion module 132 as a delete edit ‘delete[D01, D02]’. Delete editoperations only apply to the old token stream so the original spanparser is advanced. Tokens can be generated from the parsed 720 and 730byte streams. In this specific example, the old span parser is alreadyat the end of the delete edit operation and is not advanced. In somecases, the old span parser may advance tokens and/or read additionalbytes from the old token stream. Once the delete edit has beenprocessed, based on the parsed tokens of the token streams, the deleteedit operation, the location of the span parsers, and the parsed bytesof the byte strings the conversion module 132 adds potential structuraledits to the binary delta.

Moving forward, the conversion module 132 generates a third binary editoperation from the received binary delta. For example, the third binaryedit operation can be the subsequent five bytes of the binary delta bytestring, i.e. [I01, I02, I03, I04, I05]. The binary edit operation can beinterpreted by the conversion module 132 as an insert edit ‘insert I01,I02, I03, I04, I05’.

Insert edits only apply to the new token stream, so only the new spanparser is advanced. Tokens can be generated from the parsed 720 and 730byte streams. In this specific example, the previous match edit advancedthe new span parser to the end of the insert edit and the new spanparser is not advanced further. In some cases, the new span parser canadvance tokens and read additional bytes from the new token stream. Oncethe insert edit has been processed, based on the parsed tokens of thetoken streams, the insert edit operation, the location of the spanparsers, and the parsed bytes of the byte strings, the conversion module132 records potential structural edits.

While not explicitly described for this example, the structural edits ofthe structural delta can have any false positives removed as described.

Continuing using the described methodology through the binary delta bygenerating edit operations 710, parsing byte strings 720, and parsingtokens 730 and 740, adding structural edit operations to the delta, andremoving false positives from the delta, the outputted 760 structuraldelta for this specific example is:

REMOVE /a/0=1 /b/2=3 INSERT /a/0=”xyz”IV.H Splits

While not shown in any of the above examples, splits can occur whileparsing the token streams. The appropriate processing of a split dependson the type of edit in which the split is found. Again a split is a casein which a token covers one or more binary edit operations. More simply,a split occurs when the last parsed byte of each span parser are notaligned at an edge (i.e. at the last parsed token). For all edit types,the conversion module 132 detects and processes trailing-edge splits;i.e. splits detected at the end of a span because the end of the lastparsed token is after the end edge. The conversion module 132 alsoconsiders leading-edge splits for match edits.

Handling Split Insert Edits

In one implementation, the insert edit operation inserts all bytesidentified by the insert edit operation. If a trailing-edge split isdetected, the conversion module 132 performs one of two things. First,if the old token parser is at a split token, no further action is takenby the conversion module 132. The split will have been considered andprocessed for an earlier edit. Second, if the old token parser is not ata split token, the conversion module 132 reads the whole token of theold token stream and adds a REMOVE structural edit for the next pointerin the old stream. The old token stream is otherwise unaffected by theinsert event. Reading the whole token may advance the old token parserfurther into other delete or match edits. In all cases, for correcthandling of the insert split, the conversion module 132 adds a REMOVEstructural edit for the first JSON pointer found.

Further split detection is used to detect differences that affect thevalue structure. If the parser structure depth is different at the endof the span than it was at start, the last token is treated in the samemanner as a trailing-edge split. Without this, comparing [“b”] with[[“a”, “b”, “c”]] incorrectly generates INSERT/0, INSERT/2 rather thanINSERT/0, REMOVE/1, INSERT/1, INSERT/2. (Which heuristic-clean up willsimplify to a single REMOVE, INSERT pair).

Handling Split Delete Edits

The processing of splits for delete events is the inverse of that forinsert events. I.e. swap INSERT structural edits for REMOVE structuraledits and the new token stream and parser with the old token stream andparser.

Handling Split Match Edits

In one implementation, processing match edit splits moves both spanparsers. Either or both of the new token stream and old token stream canhave a trailing-edge split (i.e. a split before splits detected at thebeginning of a span because the beginning of the last parsed token isbefore the front edge). If neither are split, or if only one is split,the data distribution system server takes no further action. As proofsby example, two cases are presented: first, if there is a match editassociating a common subsequence of two different token streams, therewill also be insert or delete edits with trailing edge splits; andsecond, if the match edit ends with a start structure token ({ or [ inJSON), the differences in the structure content will be detected bylater edits.

Consideration of the parse depth is unnecessary for match trailing-edgesplits. A difference in structure depth must correspond to a binarydifference covering structure delimiters, which will be handled byinsert or delete processing.

There is another implementation for dealing with for match edits. Atoken in one token stream can match the end and start of two tokens inthe other stream. The conversion module 132 checks each token stream fora leading-edge split. If the new stream parser is at the expected offsetfrom the detected split, and the old stream parser is not, theconversion module has found a token spanning across two matches edits inthe old token stream. In this case, previous edit will have inserted thefirst pointer for the new stream. The conversion module 132 adds thenext found pointer in the new token stream in the match edit span (ifany). Similar processing is performed if the old stream parser is at theexpected offset, but the new stream parser is not.

The algorithm described herein employed by the data management module isscalable, requiring a single parse across both values. The parsers holdthe minimal state to track the current position within the valuestructure. The algorithm is stream-friendly in that neither the valuesnor the binary delta need to be fully realized in memory.

The given examples are meant to demonstrate functionality of thestructural delta algorithm. This algorithm can generate any structuraldelta from any two binary representations of JSON objects and the binarydelta between them. The structural delta can have any number and form ofstructural edits.

IV.I Visualization of Splits in Structural Deltas

A visualization of several possible split cases when determining astructural delta between two binary representations of JSON objects isdescribed.

FIG. 8C illustrates a first example of a split. In this example, thecurrent edit type 830 of the binary delta 624 is an insert. The insertedit has advanced the old span parser to the fifth byte 610 of the oldbyte string 620 and parsed the fourth token of the old token stream.Parsing the fourth token 814 does not necessitate advancing the old spanparser 820 further into the old byte string 620. As the last tokenparsed is the fourth token, the edge 830 of the old span parser 820 isaligned between the old token stream 816 and the old byte string 620.Additionally, the insert edit has advanced the new span parser 822forward and parsed the fourth token of the old token stream 818. Parsingthe fourth token 814 advances the new span parser 822 further into thenew byte string 620. As the last token parsed is the fourth token theedge 832 of the new span parser 822 is aligned with the fifth byte ofthe new byte string N05. As the new span parser 822 advanced in the newbyte string 822 to the seventh byte N07, the new span parser ismisaligned with the edge and there is a split. This type of split cangenerate a structural edit 840, e.g. REMOVE or INSERT, for thestructural delta 842.

FIG. 8D illustrates a second example of a split. In this example, thecurrent edit type 830 of the binary delta 624 is a match. The match edithas advanced the old span parser to the fourth byte 610 of the old bytestring 620 and parsed the third token of the old token stream. Parsingthe third token 814 advances the new span parser 822 further into thenew byte string 620 to 005. As the last token parsed is the third token,the edge 830 of the old span parser 820 is aligned with the fourth byteof the new byte string 620 O04. Additionally, the match edit hasadvanced the new span parser 822 to the fourth byte 610 of the old bytestring 620 and parsed the third token of the old token stream. Parsingthe third token 814 advances the new span parser 822 further into thenew byte string 620 to N07. As the last token parsed is the fourthtoken, the edge 832 of the new span parser 822 is aligned with the fifthbyte of the new byte string 622 N05. The span parsers are misalignedwith their respective edges and there is a split. This type of split cangenerate a structural edit 840 for the structural delta 842.

FIG. 8E illustrates a third example of a split. In this example, thecurrent edit type of the binary delta 624 is a match. The position ofthe new span parser 822 is aligned with the edge 832, i.e. the lastparsed token 814 does not advance the new span parser 822 into the newbyte string 622. The position of the old span parser 820 is misalignedwith the edge 830, i.e. the last parsed token advanced the old spanparser further into the old byte string 620. This type of split cangenerate a structural edit 840 for the structural delta 842.

These examples of FIGS. 8C-8E are illustrations of three specific splitcases. One skilled in the art will note that a split can occur at anytime that when at least one span parser is misaligned with an edge.There can be any number of cases of generating a structural delta thatcan induce a split. Each split can generate a case specific structuraledit 840 for the structural delta 842.

V. Other Considerations

FIG. 9 is a schematic diagram of a computing device for implementing aserver 110 or client device, according to one embodiment. Thecomputing-based device may be implemented as any form of a computingand/or electronic device in which embodiments of the pub/sub system maybe implemented.

The computing-based device comprises one or more processors 902 whichmay be microprocessors, controllers or any other suitable type ofprocessors for processing computer executable instructions to controlthe operation of the device in order to manage and control publish andsubscribe operations in a pub/sub system. In some examples, for examplewhere a system on a chip architecture is used, the processors 902 mayinclude one or more fixed function blocks (also referred to asaccelerators) which implement a part of the queue serving andtransmission methods in hardware (rather than software or firmware).

The computing-based device also comprises an input interface 904,arranged to receive messages relating to a topic from the publishers 114when the publishers 114 are in an external system 102, and at least onenetwork interface 906 arranged to send and receive data messages overthe communication network 180. In some examples, the input interface 904and network interface 906 can be integrated.

Computer executable instructions may be provided using anynon-transitory computer-readable media that is accessible by thecomputing based device. Non-transitory computer-readable media mayinclude, for example, computer storage media such as memory 908 andcommunications media. Computer storage media, such as memory 908,includes volatile and non-volatile, removable and non-removable mediaimplemented in any method or technology for storage of information suchas computer readable instructions, data structures, program modules orother data. Computer storage media includes, but is not limited to, RAM,ROM, EPROM, EEPROM, flash memory or other memory technology, CD-ROM,digital versatile disks (DVD) or other optical storage, magneticcassettes, magnetic tape, magnetic disk storage or other magneticstorage devices, or any other non-transmission medium that can be usedto store information for access by a computing device. Although thecomputer storage media (memory 908) is shown within the computing-baseddevice it will be appreciated that the storage may be distributed orlocated remotely and accessed via a network or other communication link(e.g. using network interface 906).

Platform software comprising an operating system 910 or any othersuitable platform software may be provided at the computing-based deviceto enable application software 912 to be executed on the device.Additional software provided at the device may include publish/subscribelogic 914 for implementing the various functions described herein. Thememory 908 can also provide a data store 918, which can be used toprovide storage for data used by the processors 902 when performing thequeue serving and transmission operations. This can include storing ofthe messages from the publishers and storing of the virtual queues.

The term ‘computer’ is used herein to refer to any device withprocessing capability such that it can execute instructions. Thoseskilled in the art will realize that such processing capabilities areincorporated into many different devices and therefore the term‘computer’ includes PCs, servers, mobile telephones, personal digitalassistants and many other devices.

Those skilled in the art will realize that storage devices utilized tostore program instructions can be distributed across a network. Forexample, a remote computer may store an example of the process describedas software. A local or terminal computer may access the remote computerand download a part or all of the software to run the program.Alternatively, the local computer may download pieces of the software asneeded, or execute some software instructions at the local terminal andsome at the remote computer (or computer network). Those skilled in theart will also realize that by utilizing conventional techniques known tothose skilled in the art that all, or a portion of the softwareinstructions may be carried out by a dedicated circuit, such as a DSP,programmable logic array, or the like.

Various other modifications, changes and variations which will beapparent to those skilled in the art may be made in the arrangement,operation and details of the method and apparatus of the presentembodiments disclosed herein without departing from the spirit and scopeof the disclosure as defined in the appended claims. Therefore, thescope of the disclosure should be determined by the appended claims andtheir legal equivalents.

What is claimed is:
 1. A method for publishing topic updates to a clientdevice, the method comprising: maintaining, at a data distributionsystem server, a data structure describing a plurality of topics towhich a client can subscribe and a publisher can publish; transmitting,to the client from the data distribution system server, a first binaryrepresentation of a data object describing a topic value of a topic ofthe plurality of topics to which the client is subscribed; andresponsive to receiving a changed topic value of the topic within thedata structure from the publisher at the data distribution server,publishing a binary delta to the client that represents a differencebetween the first binary representation and a second binaryrepresentation of a second data object describing the changed topicvalue of the topic, wherein the client is adapted to: receive thepublished binary delta, calculate the second binary representation fromthe binary delta and the first binary representation, generate astructural delta from the first binary representation, the second binaryrepresentation, and the binary delta, the structural delta representinga structural difference between data structures of the first text-basedobject and data structures of the second text-based object, anddetermine the changed topic value of the topic from the structuraldelta.
 2. The method of claim 1, further comprising: receiving, at thedata distribution system server from the client, a subscription to thetopic of the plurality of topics; and updating the data structuredescribing the plurality of topics responsive to the receivedsubscription.
 3. The method of claim 2, wherein transmitting the firstbinary representation of the first data object describing the topicvalue to the client is in response to receiving the subscription to thetopic from the client.
 4. The method of claim 1, further comprising:responsive to receiving the changed topic value of the topic from thepublisher at the data distribution server, calculating the binary deltabetween the changed topic value of the topic and the topic value of thetopic.
 5. The method of claim 1, wherein the data structure maintainedby the data distribution server comprises a topic tree maintaining topicvalues for each of the plurality of topics as binary representations,and the method further comprises: updating the topic value of the topicin the topic tree to the changed topic value using the binary delta. 6.The method of claim 1, wherein maintaining the data structure describinga plurality of topics to which the client can subscribe and thepublisher can publish further comprises: storing information detailing amost recent binary delta transmitted to the client; and storing a mostrecent binary representation of the topic value of the topic transmittedto the client.
 7. The method of claim 6, wherein a subsequent binarydelta published to the client is based on the most recent binary deltaand the most recent binary representation of the topic value of thetopic transmitted to the client.
 8. The method of claim 1, furthercomprising: receiving, at the data distribution system server from theclient, an unsubscription request to the topic of the plurality oftopics; and wherein responsive to receiving an additional changed topicvalue of the topic from the publisher at the data distribution server,the data distribution system does not publish a new binary delta to theclient that represents a difference between the additional changed topicvalue of the topic and the changed topic value of the topic.
 9. Anon-transitory computer-readable storage medium storing computer programinstructions for publishing topic updates to a client device, thecomputer program instructions executable by a computer processor toperform operations comprising: maintaining, at a data distributionsystem server, a data structure describing a plurality of topics towhich a client can subscribe and a publisher can publish; transmitting,to the client from the data distribution system server, a first binaryrepresentation of a data object describing a topic value of a topic ofthe plurality of topics to which the client is subscribed; andresponsive to receiving a changed topic value of the topic within thedata structure from the publisher at the data distribution server,publishing a binary delta to the client that represents a differencebetween the first binary representation and a second binaryrepresentation of a second data object describing the changed topicvalue of the topic, wherein the client is adapted to: receive thepublished binary delta, calculate the second binary representation fromthe binary delta and the first binary representation, generate astructural delta from the first binary representation, the second binaryrepresentation, and the binary delta, the structural delta representinga structural difference between data structures of the first text-basedobject and data structures of the second text-based object, anddetermine the changed topic value of the topic from the structuraldelta.
 10. The non-transitory computer-readable storage medium of claim9, wherein the instructions are executable to perform further operationscomprising: receiving, at the data distribution system server from theclient, a subscription to the topic of the plurality of topics; andupdating the data structure describing the plurality of topicsresponsive to the received subscription.
 11. The non-transitorycomputer-readable storage medium of claim 9, wherein transmitting thefirst binary representation of the first data object describing thetopic value to the client is in response to receiving the subscriptionto the topic from the client.
 12. The non-transitory computer-readablestorage medium of claim 9, wherein the instructions are executable toperform further operations comprising: responsive to receiving thechanged topic value of the topic from the publisher at the datadistribution server, calculating the binary delta between the changedtopic value of the topic and the topic value of the topic.
 13. Thenon-transitory computer-readable storage medium of claim 9, wherein thedata structure maintained by the data distribution server comprises atopic tree maintaining topic values for each of the plurality of topicsas binary representations, and the instructions are executable toperform further operations comprising: updating the topic value of thetopic in the topic tree to the changed topic value using the binarydelta.
 14. The non-transitory computer-readable storage medium of claim9, wherein maintaining the data structure describing a plurality oftopics to which the client can subscribe and the publisher can publishfurther comprises: storing information detailing a most recent binarydelta transmitted to the client; and storing a most recent binaryrepresentation of the topic value of the topic transmitted to theclient.
 15. The non-transitory computer-readable storage medium of claim14, wherein a subsequent binary delta published to the client is basedon the most recent binary delta and the most recent binaryrepresentation of the topic value of the topic transmitted to theclient.
 16. The non-transitory computer-readable storage medium of claim9, wherein the instructions are executable to perform further operationscomprising: receiving, at the data distribution system server from theclient, an unsubscription request to the topic of the plurality oftopics; and wherein responsive to receiving an additional changed topicvalue of the topic from the publisher at the data distribution server,the data distribution system does not publish a new binary delta to theclient that represents a difference between the additional changed topicvalue of the topic and the changed topic value of the topic.
 17. Asystem for publishing topic updates to a client device, the systemcomprising: a publisher configured to publish topic updates; a clientconfigured to subscribe to topic updates; and a data distribution servercomprising: a computer processor; and non-transitory computer-readablestorage medium storing computer program instructions, the computerprogram instructions executable by a computer processor to performoperations comprising: maintaining, at the data distribution systemserver, a data structure describing a plurality of topics to which aclient can subscribe and a publisher can publish; transmitting, to theclient from the data distribution system server, a first binaryrepresentation of a data object describing a topic value of a topic ofthe plurality of topics to which the client is subscribed; andresponsive to receiving a changed topic value of the topic within thedata structure from the publisher at the data distribution server,publishing a binary delta to the client that represents a differencebetween the first binary representation and a second binaryrepresentation of a second data object describing the changed topicvalue of the topic; and wherein the client is adapted to: receive thepublished binary delta, calculate the second binary representation fromthe binary delta and the first binary representation, generate astructural delta from the first binary representation, the second binaryrepresentation, and the binary delta, the structural delta representinga structural difference between data structures of the first text-basedobject and data structures of the second text-based object, anddetermine the changed topic value of the topic from the structuraldelta.
 18. The system of claim 17, wherein the instructions areexecutable to perform further operations comprising: receiving, at thedata distribution system server from the client, a subscription to thetopic of the plurality of topics; and updating the data structuredescribing the plurality of topics responsive to the receivedsubscription.
 19. The system of claim 17, wherein transmitting the firstbinary representation of the first data object describing the topicvalue to the client is in response to receiving the subscription to thetopic from the client.
 20. The system of claim 17, wherein theinstructions are executable to perform further operations comprising:responsive to receiving the changed topic value of the topic from thepublisher at the data distribution server, calculating the binary deltabetween the changed topic value of the topic and the topic value of thetopic.